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ABSTRACT 

We present and discuss a method to identify substructures in combined angular- 
redshift samples of galaxies within Clusters. The method relics on the use of Discrete 
Wavelet Transform (hereafter DWT) and has already been applied to the analysis of 
the Coma cluster (Gambera et al. 1997). The main new ingredient of our method with 
respect to previous studies lies in the fact that we make use of a 3D data set rather 
than a 2D. We test the method on mock cluster catalogs with spatially localized sub- 
structures and on a N-body simulation. Our main conclusion is that our method is able 
to identify the existing substructures provided that: a) the subclumps are detached in 
part or all of the phase space, b) one has a statistically significant number of redshifts, 
increasing as the distance decreases due to redshift distortions; c) one knows a priori 
the scale on which substructures are to be expected. We have found that to allow 
an accurate recovery we must have both a significant number of galaxies (s=s 200 for 
clusters at z> 0.4 or about 800 at z< 0.4) and a limiting magnitude for completeness 
rriB = 16. 

The only true limitation to our method seems to be the necessity of knowing a priori 
the scale on which the substructure is to be found. This is an intrinsic drawback of the 
method and no improvement in numerical codes based on this technique could make 
up for it. 
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1 INTRODUCTION 

A large fraction of Clusters of galaxies shows a very complex 
structure, either in the spatial and velocity distribution of 
the galaxies themselves (e.g. West & Bothun 1990), or in 
the X-ray maps (e.g. Sarazin 1988), or in both. Although 
the substruct ures a re often clearly visible in the optical (e.g. 
Mellier et al. (1988)) and on the X-ray maps (Grebenev et al. 
1995) it is not always easy to make a quantitative analysis of 
the amount of substructure present. During the past years, 
many methods borrowed from statistical theory have been 
applied to attempt determining the number and properties 
of the subgroups. Most of these methods have been applied 
to 2D angular position galaxy maps or ID line-of-sight ve- 
locity distributions. In a recent paper (Gambera et al. 1997, 
hereafter GPAB) we have tried to make use of the full 3D 
information available from a complete sample of galaxy pro- 
jected positions and line-of-sight velocities for Abell 1656 
(the Coma cluster) in order to recover and determine some 
morphological properties of the substructure. In this paper 
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we will explain and test the method introduced in GPAB on 
simulated cluster catalogs and we will determine the mini- 
mal requirements which have to be satisfied in order to apply 
it with confidence. 

Our method is based on the use of the Discrete Wavelet 
Transform (hereafter DWT), a technique originally intro- 
duced in turbulence studies (see e.g. Farge 1992). Sometimes 
the Continuos Wavelet Transform (hereafter CWT) has been 
applied to characteri ze su bstructure in galaxy clusters (e.g. 
Escalera & Mazure (1 



1986 



hereafter EM), but it has been 
convincingly shown that the two methods are unrelated to 
each other (i.e. CWT is not the continuous limit of DWT), 
and that CWT is unsuitable to characterize the struct ure of 
clusters (|Fang fc Pando 1997|; [Pando fc Fang 1996[). The 



DWT admits a comple te, compact supported orthogonal 
bases (Daubechies 1988), and for this reason it represents 



a suitable tool for the analysis of finite samples of objects, 
like galaxy catalogs. 

One point of our 3D DWT method deserves some clarifi- 
cation, i.e. the fact that we are using heterogeneous data, 
namely the projected position on the sky and the line-of- 
sight velocity. Although many authors have used the distri- 
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bution function of the projected velocity in their statistical 
analysis of substructure, one can argue that in the highly 
nonlinear environment of a cluster both redshift distortions 
and relaxation effects destroy the relationship between line- 
of-sight (l.o.s.) velocity and distance, thus making appar- 
ently meaningless the use of the l.o.s. velocity in a context 
where one tries to isolate substructures in real space. Sup- 
pose however that the substructure we are trying to identify 
is made of spatially separated clumps whose inner velocity 
dispersion is smaller than the average l.o.s. distance among 
the clumps themselves. The velocity of a galaxy can be writ- 
ten as: v g = Hr + v pBC , where H is the Hubble constant and 
v pec measures the deviations induced by random velocities 
within the cluster due to relaxation effects and systematic 
infall motions. The quantity v pec is a stochastic variable, de- 
scribed by some probability distribution, but we kno w from 
observations (e.g. the EN ACS survey, Mazure et al. (1996)) 



and fro m simulations (Natarajan, Hjorth & van Kampen 
199? |) t[ iat its moments filtered on cluster scales are finite. 
So if: Hr >> v p( , c the Hubble term dominates the l.o.s. ve- 
locity, and this quantity can then be used to characterize 
substructures whose relative distances within the cluster are 
larger than v pec /H. 

In order to test our 3D DWT method and to determine 
reliable confidence limits, we have simulated a total num- 
ber of 20 clusters changing the following parameters: 1. the 
distance of the cluster from the observer; 2. the number of 
clumps making up the cluster; 3. the number of galaxies 
inside the cluster and the individual clumps; 4. the mean 
distance separating different clumps in the same cluster; 5. 
the completeness of our catalogues. The analysis has been 
performed using a parallel code that allows a rapid struc- 
ture detecti on and morphological ana lysis in a 3-D set of 
data points ( Pagliaro fc Becciani 1998 ). We have also anal- 
ysed the output of a N-body simulation, where substructure 
has a more hierarchical distribution. 

The plan of the paper is as follows: in §2 we describe the 
method of analysis, in §3 we describe how we engendered 
our mock catalogues, which are then analysed in §4. In §5 
we discuss the statistical robustness of our results and finally 
in §6 we report our conclusions. 



2 THE METHOD OF ANALYSIS 
2.1 Method overview 

The 3D DWT method can be divided into three steps. In 
the first one computes the wavelet matrices on all the scales 
investigated: we adopt the "a trous" algorithm to perform 
this task , as described by (^eg^^94) (hereafter L94). The 
same matrices are computed for the data to be analyzed and 
on a random distribution in the same region of space and 
on the same grid as the real data. On these latter matrices 
we calculate the threshold corresponding to a certain confi- 
dence level in the structure detection. 

The second step is the segmentation analysis. The aim of this 
analysis is to have all the connected pixels with a wavelet 
coefficients greater than the threshold labelled with an inte- 
ger number different for every single structure. This allows 
a rapid identification of connected regions which are needed 
as input for the third step. 



The third and last step of the method is the computation of 
a morphological parameter for every structure singled out 
and of a mean morphological parameter for each scale. 

In the following of this section we describe the serial 
version of our algorithm. A detailed description of the par- 
allel implementation has been given in mor e technical papers 
(IPagliaro & Becciani 19981; IPagliaro 19981). 



2.2 The wavelet transform 

Generally speaking, a wavelet transform is the decomposi- 
tion of basis functions obtained by translation and dilation 
of a particular function localized in both physical and fre- 
quency space. A characteristic features of this kind of anal- 
ysis is that it allows a simultaneous study of both positional 
and scaling properties. Although our method is devised to 
deal with 3-D data, for the sake of simplicity we describe 
here the 1-D version. The generalization to the 3-D case is 
straightforward. 

For a one- dimensional function f(x) the wavelet trans- 
form is a linear operator that can be written as: 



w(s,t) = {f\i>) 

2 f + °°mr(^)dx 



(i) 



where s(> 0) is the scale on which the analysis is performed, 
t £ K is the spatial t ransla t ion p arameter and ip is the the 
Grossmann-Morlet ((1984), (1987)) analyzing wavelet func- 
tion 



(2) 



that is spatially centered around position t and on a scale 
s. The wavelet function ^>( li0 )(x) is called mother wavelet. 
It generates the other wavelet function ^( B ,t)( x ), s > 1- We 
follow L94 in the choice of the mother wavelet in order to 
use the a trous algorithm in the following. 



(3) 



where cj> is the cubic centred B-spline function defined by: 



■ 41a; - 



•6b|' 



■4|a: + l| 3 + |a: + 2| 3 



12 



(4) 



In order to use the d trous algorithm we choose a set of 
scales which are powers of two: s = 2 r and the first scale 
always corresponds to the size of 1 pixel. The scale s in 
this kind of analysis may be considered as the resolution. In 
other words, if we perform a calculation on a scale so, we 
expect the wavelet transform to be sensitive to structures 
with typical size of about so and to be able to reveal them. 
The first step of the wavelet matrices computation is the 
evaluation of the coefficient c(0). This is defined as: 

c(0,t) = {f(x)\4>(x-t)) (5) 

On the other scales the coefficients c are given by: 

(6) 



Since the function d> satisfies: 



1 

2 l + 1 ' 



71 = 2 

n— — 2 



(7) 
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P[Wthres(s) < I0„„(s)] < £ 



(10) 




x (degrees) 

Figure 1. Projected positions of galaxies in a typical mock cat- 
alogue obtained as described in §3.1 



for i > 0, we can write: 

2 

c(s,t) — S h(n)c(s — 



l,t + n2 r ~ 1 ) 



(8) 



where h(n) — j^C^-n, C™ being the binomial coefficients. 
Using to eqs.|l] and |^ we can write the following expression 
for the wavelet coefficients on the various scales: 



w(s,t) — c(s,t) — c(s — l,t) 



(9) 



The wavelet analysis associates to each pixel a real num- 
ber, which represents the smoothed local density contrast at 
a given scale. At the end of this part our result is a set of 
matrices of wavelet coefficients; one matrix for each scale 
investigated. 

Even if the histogram of the wavelet coefficients may sug- 
gest the presence of substructure, revealed by asymmetries 
between the positive and negative parts of the probability 
distributions (see e.g. figs. 2-4 in GPAB), this kind of infor- 
mation is only visual and not easily quantifiable and spa- 
tially localizable. 



2.3 Thresholding 

The thresholding is made on the wavelet coefficient his- 
togram. For an ideal flat background, the wavelet transform 
coefficients should be equal to zero. The existence of struc- 
tures at a given scale gives wavelet coefficient with large pos- 
itive values. It is however quite obvious that this is strictly 
true only in an ideal case: a random distribution may have 
non-zero coefficients even if there are no structures, due to 
statistical fluctuations. Moreover, the statistical behaviour 
of the wavelet coefficient is complex due to the correlation 
among nearby pixels. 

In order to decide whether a structure detected on a given 
scale we need to fix a significance threshold. We choose it 
through a classical decision rule. We calculate the wavelet 
coefficients w ra n(s) for each scale of our analysis, for a ran- 
dom distribution in the same region of space of our data 
and on the same grid. Then we calculate the probability 
P[w(s) < w ran (s)] and choose the value Wthres(s) so that: 



Our threshold on the scale s is the value Vthres = w t h res (s). 
For example, a choice for the value of e of: 



e = 0.001 



(11) 



ensures a 99.9% confidence level in the structure detection. 
We have also explored the consequences of an alternative 
choice for the treshold, i.e. to fix it in terms of a given num- 
ber of standard deviations from the variance, but the final 
results are insensitive to this choices. 



2.4 Structure numbering by means of 
segmentation 

The second step of our analysis is the determination of con- 
nected pixels over a fixed threshold (segmentation, Rosenfeld 
the numbering of the selected structures and their 



morphological analysis. 

The segmentation and numbering consists in the exam 
of the wavelet coefficients matrix; all the pixels associated 
with a wavelet coefficient greater than the selected threshold 
are labelled with an integer number. All other pixel labels 
are set equal to zero. Then, the same label is associated with 
all the pixels connected in a single structure, in a sequential 
way. So, the first structure individuated bears the label T' 
and so on. We also compute the volume and surface of each 
structure found. 



2.5 The morphological parameter 

In order to perform a morphological analysis we have to 
introduce a morphological parameter that quantifies the 
sphericity of the structures. We choose the parameter: 



V 2 



(12) 



where V is the volume and S is the surface, as in L94, and 
K(s) is a parameter that depends on the scale of the anal- 
ysis. We want L(s) to have the following behaviour: zero 
for very filamentary structures and 1 for spherical ones. 
This may be achieved putting K = 367r, but only for those 
scales not affected by the granular nature of the analysis. 
We choose the value 367r only for the scales s = 2 r pixels 
with r > 2. For the smallest scales the constant 367r is not 
adequate, since we are close to the grid resolution and the 
geometry of the substructures cannot be spherical. Since we 
want to consider as spherical a one-pixel structure, we adopt 
the values: 



K{2 r 



216 ifr = 0, 1 
36-7T otherwise 



(13) 



Then, for every detection threshold we calculate the values: 



(Hs)) = £ 



N obj 



(14) 



where N bj is the number of objects detected at scale s. 



3 SIMULATING CLUSTERS OF GALAXIES 
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Table 1. Number of distinct substructures above the significance 
threshold detected by our method. All lengths are expressed in 
h~ 1 Mpc. The first six columns report, respectively: Name of the 
simulated cluster, Magnitude for 100 % completeness, Number 
of clumps making up the cluster, Distance to the observer, Total 
number of galaxies in the cluster, Mean separation among clumps. 
In the last four columns the number of substructures detected at 
scales 0.5 -r- 4h _1 Mpc. The clusters marked by an asterisk have 
been reshuffled 10 + 10 times. 
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Figure 2. The wedge diagram corresponding to the projected 
positions of fig.l 



3.1 Mock catalogs 

Our model clusters are engendered by randomly assembling 
a number of clumps, each characterized by the position of 
their centers. As coordinates we choose two angular coordi- 
nates and the line-of-sight velocity: (0, <f>, Vi os ). Given the po- 
sition of the center of each clump (9 c i,4>ci,Vci), galaxies are 
distributed about the center according to a gaussian distri- 
bution in real space of width ay. Velocities are then specified 
by adding to the Hubble flow a random component drawn 
from a maxwellian distribution with a given dispersion a c i. 
Each simulated cluster is then made of a collection of these 
clumps. This method allows a partial superposition of the 
clumps. 

To each galaxy in the cluster we also attribute an abso- 
lute magnitude drawn randomly from a Schechter luminos- 
ity function: 



-L/L 



dL 



(15) 



The values for m* = — 2.5Log(L*), <f)* and a are appropriate 



to t he c entral region of the Coma cluster (Biviano et al 

i99ej)7] 

We show in Fig. 1 the projected positions of galaxies in 
a typical mock catalogue obtained with these prescriptions, 
and in Fig. 2 the corresponding wedge diagram. It is evident 
from these plots that substructure in these mock catalogues 
is made of independent groups which overlap each other. 



3.2 N-body simulation 

We also applied our test to the result of a N-body simulation, 
where density distribution is clearly hierarchical. We have 
considered a 10 3 h~ 3 Mpc 3 box extracted from a 16-million 
particles simulation of a 50 h~ 1 Mpc box (Antonuccio- 
Delogu et al., in preparation). Using SKID, a standard gravi- 
tationally bound groups finder (Stadel et al., in preparation) 
we have found 5 groups on a scale of 500 h" 1 kpc within this 
box. 



4 TESTING SUBSTRUCTURE RECOVERY 

4.1 Is the DWT capable to recover substructure? 

In order to assess the capability of our code to recover sub- 
structures we simulate five clusters of galaxies at a fixed 
distance made of an increasing number of well separated 
clumps: from only 1 clump to 15. Clusters are identified by 
the names C13 (containing 1 clump, 528 galaxies), C14 (3 
clumps, 1224 galaxies), CI (5 clumps, 1512 galaxies), C15 (8 
clumps, 3456 galaxies) and C16 (15 clumps, 8750 galaxies). 
The mean separation between two clumps inside a cluster 
is set equal to 2.7 7i _1 Mpc. As one can see from Table [l] 
substructures are well detected. Unfortunately, an intrinsic 
drawback of the DWT is the unability to distinguish the 
typical scale of substructures (Biajoui, private communica- 
tion). Substructures "created" by our simulations are on a 
typical scale of about 2h~ x Mpc, so we can detect them 
looking at the right scale. 



4.2 Dependence on the distance from the observer 

We simulate four different clusters of galaxies with distances 
ranging from 8 to 68 ft -1 Mpc with step 20 ft -1 Mpc. Clus- 
ters are identified by the names CI (set at a distance 68 h~ x 
Mpc from the observer), C2 (48 h" 1 Mpc), C3 (28 h' 1 Mpc) 
and C4 (8 ft" 1 Mpc). All clusters are made of 1512 galaxies 
divided in 5 clumps with mean separation 2.7 h^ 1 Mpc. 

As one can see from Table |^, the recovery does not de- 
pend on the distance from the observer. However at small 
distances the Hubble flow and the peculiar velocities of the 
galaxies becomes comparable, and we must take into ac- 
count this effect. We have repeated the analysis on each out 
of 20 realizations of the four clusters obtained by randomly 
"reshuffling" the original simulated catalogues with two dif- 
ferent method: 1. the first 10 reshuffiings were made by redis- 
tributing randomly the redshifts among the galaxies while 
keeping the angular coordinates fixed (see Table [^); 2. the 
remaining 10 reshuffiings were made varying the redshifts 
(namely the value of cz) of a typical value of the galaxies 
peculiar velocities in a random direction, while keeping the 
angular coordinates fixed (see Table |^) . 

In the first case, the average values of the number of 
structures found is always smaller than the one found in 
the original simulated clusters. This test, already performed 
in GPAB on a catalogue of clusters from the Coma clus- 
ter with the same result, strengthens our confidence on the 
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Table 2. Tests on mock clusters: varying distances. Symbols are 
as in Table 1, but varying the mean distance from the observer. 



N 


m B 


N c 


do 


N s 


<k 


0.5 


1 


2 


4 


CI* 


oo 


5 


68 


1512 


2.7 


12 


6 


5 


2 


C2* 


CO 


5 


18 


1512 


2.7 


13 


7 


5 


2 


C3* 


CO 


5 


28 


1512 


2.7 


13 


7 


i 


2 


C4* 


oo 


5 


8 


1512 


2.7 


13 


6 


5 


2 



Table 3. Tests on mock clusters: varying richness. Symbols as in 
Table 1. 
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physical significance of the structures detected in Coma. In 
the second case the number of structures is nearly the same 
as in the original catalogues. We "followed" some galaxies 
labelling them in the reshufflings and we have seen that the 
galaxies labelled are always inside the same substructure 
for the clusters CI, C2, C3 and reshuffled samples, but may 
change substructure for the case of the cluster C4. This hap- 
pens because in this latter case the peculiar velocities and 
the Hubble flow become comparable. However it is worth 
noticing that, although the galaxies may "leap" from one 
substructure to another, the number of these is always nearly 
the same. 

4.3 Dependence on the number of galaxies 

We simulate six clusters of galaxies made of an increasing 
number of data points (galaxies) ranging from a minimum 
of 118 to a maximum of 14712. Clusters are identified by 
the names C5 (made of 118 galaxies), C6 (330 galaxies), C7 
(528 galaxies), CI (1512 galaxies), C8 (7032 galaxies) and 
C9 (14712 galaxies). All these clusters are composed of 5 
clumps with mean separation 2. .7 ft" 1 Mpc. It is clear from 
our results that a minimum number of at least 200 data 
points is required to ensure a correct substructure recov- 
ery and that an increasing number of points improves the 
confidence of the analysis. 

4.4 Dependence on the interclumps separation 

Clusters are identified by the names C10 (mean separation 

1.5 ft" 1 Mpc), CI (2.7 ft" 1 Mpc), Cll (3.7ft" 1 Mpc) and 
C12 (4.7ft -1 Mpc). We note no significant variations in the 
number of substructures detected as the mean interclumps 
separation vary, as one can see from Table ^, if the mean 
separation is greater than 1.5 ft" 1 Mpc. 

Table 4. Tests on mock clusters: varying clump distances. Sym- 
bols as in Tabic 1. 
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Table 5. Tests on mock clusters: varying completeness. Symbols 
as in Table 1. 
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Table 6. Tests on mock catalogues: summary. Columns are as in 
Table 1. 
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4.5 Dependence on completeness 

Clusters are identified by the names C17 (100 % complete- 
ness at m B = 22), C18 (m B = 20), C19 (m B = 18) and C20 
(m B = 16). We note a progressive decrease in the number of 
structures found, as the magnitude of completeness decrease. 
However, it is worth noticing that, till the value m B = 16 
included, this decrease is not dramatic and the number of 
structures detected is very close to the effective number of 
structures composing the clusters. We can consider this re- 
sult as a warning to keep in mind while examining catalogues 
with low completeness. In this case the number of substruc- 
tures detected could be less than the effective number of 
substructures. 

4.6 Test on a N-body simulation 

In order to test the method on a real hierarchical struc- 
ture which has many possible substructures at many length- 
scales, we run a N-body cosmological simulation of a cubic 
box of present size L = 50ft" 1 Mpc with a number of parti- 
cles n = 16, 666.216. 

The purpose of the simulation is to form a configura- 
tion clustered on several scales. For this reason, we use con- 
strained initial conditions. 

The wavelet analysis is performed on these sets of points 
for three scales: 50, 100, 200 ft" 1 kpc. The confidence level 
of detection is 99.5% for all the scales and the thresholds 
have been computed from a random simulation in the same 
region with the same number of points. 

As one can see from the result in Table H our method 



© 0000 RAS, MNRAS 000, 000-000 



6 A. Pagliaro et al. 



detects quite a few structures on these scales. In particular 
on a scale of 500 h~ x kpc it detects 5 structures. We have 
analysed the same simulation using SKID (Stadel et al., in 
preparation), a tool used to detect gravitationally bound ob- 
jects, and found more than 50 objects when using the same 
percolation length scale. This apparently contradictory re- 
sult is not strange when one thinks that SKID looks for 
gravitationally bound objects, while our method inspects 
the local density field and looks for relatively isolated ob- 
jects. This cannot be the case of the output of purely col- 
lisionless N-body simulation, where gravitationally isolated 
bound groups are rare. In fact, each gravitationally bound 
group on a given scale hosts also non-gravitationally bound 
particles, which are often irregularly distributed across many 
different groups. 

This restriction of our method should not however cause 
problems for galaxy catalogues, because galaxies are rela- 
tively isolated objects even when they trace an hierarchically 
clustered density field. 

As a matter of fact, we already applied a serial version 
of the wavelet code on a catalogue of galaxies for the Coma 
clusters and the method has been able to detect hidden hi- 
erarchical substructures on different scales (see GPAB). 



5 STATISTICAL ROBUSTNESS 

Drawing from a wavelet analysis of a catalogue of a com- 
bined angular-redshift distribution any conclusion about the 
real phase- and configuration space structure requires that 
one verifies first that the catalogue does not suffer from any 
systematic selection biases or from other types of systematic 
effects like those induced by redshift distortions, as described 
by Regos & Geller (1989) and Praton & Schneider (1989). 
About the latter we notice that they have little significance 
for a distant cluster like Coma (GPAB), in which the Hubble 
flow term is dominant over the peculiar velocity within the 
substructures. These effects may on the other hand affect 
the analysis in closer clusters, but our work shows that in 
the presence of a significant statistics (that we quantify with 
about 800 galaxies members) these effects can be neglected 
too. One can reasonably argue that because the structures 
we find in simulated clusters of galaxies are generally well 
within the nonlinear virialized region, on these scales we are 
probing a region of the phase space detached from the Hub- 
ble flow, where the linearity between redshift and distance is 
completely lost. On the other hand one also expects that the 
phase-space distribution within the nonlinear region should 
be enough well-mixed within each clump (if there are any) 
that the substructures detected correspond to substructures 
in velocity space. 

In order to check this latter hypothesis we have re- 
peated the wavelet analysis in the distance dependence 
study (the distance is the parameter mostly affected from 
redshift effects) on each of 10 realizations obtained by ran- 
domly "reshuffling" the original catalogue, i.e. redistributing 
randomly the redshifts among the galaxies while keeping the 
angular coordinates fixed. 

The results are consistent with those found by EM who 
performed a similar analysis for 2-D catalogues. The average 
values of the number of structures is always smaller than the 



Table 7. Results of substructure recovery in totally reshuffled 
clusters. All lengths are expressed in h~ 1 Mpc; In columns 1 and 
2: Name of the simulated cluster and distance to the observer, 
respectively. Columns 3-4: number of substructures detected at 4 
scales in the range: 0.5 -r 4/i -1 Mpc. 
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4.3 ±0.7 


4.6 ±0.8 
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2.0 ±0.1 
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48 


13.0 ± 1.6 


6.8 ± 1.0 


4.6 ±0.8 
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C3 


28 


12.8 ± 1.8 


5.4 ± 1.2 


5.2 ±1.0 


2.0 ±0.1 


Cl 
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12.4 ± 1.2 


6.0 ± 1.2 


4.2 ±0.6 


2.0 ±0.0 



one found in the original catalogue, showing that the cata- 
logue itself is probably contaminated by some uncertainty, 
probably connected to the arbitrariness in the choice of the 
redshift limits, by some background contaminants, etc. 

Another kind od "reshuffling" has been performed on 
the same data varying the value of the redshift by a typical 
peculiar velocity in a random direction and following some 
target galaxies. These may leap from one substructure to an- 
other, but as the number of galaxies is high the morphology 
of substructure is not modified. 

These tests strengthen our confidence on the physical 
significance of most of the substructures that can be de- 
tected by means of this method, particularly when deal- 
ing with a great number of galaxies members or with fewer 
galaxies, but in a more distant cluster. 



6 CONCLUSIONS 

In the last years contradictory conclusions on various meth- 
ods to detect significant substructures in clusters of galaxies 
have been reported by seve ral authors (Fitchett fc Web - 
ster ( ]1997| ), W est et al. ( |l988; ), Dres sier fc Shectman ( |l988| ), 
Mellier et al. (|l988j), Slezak et al. (|l990|), EM, Escalera & 



Mazure ( |1992[ ), L94, Lega et al. ( |1994[ ), GPAB). Among the 
methods that have been tested and used for this purpose, we 
believe that the most powerful is the one based on wavelet 
transforms and in this paper we have investigated its depen- 
dence on various parameters that characterize a cluster of 
galaxies, like its distance from the observer or the number of 
clumps and/or of galaxies that makes it and on some selec- 
tion effects. According to our analysis the wavelet transforms 
method is a very powerful method to recover substructures 
inside clusters of galaxies, rather independently from the 

Table 9. Results of substructure recovery in a N-body test: Num- 
ber of substructures detected at scales 50, 100, 200 h^ 1 kpc with 
corresponding morphological parameters 

N-BODY TEST 

scale N.obj. morph. 



50 

100 

200 



0.70 
0.58 
0.12 
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many features that may vary in a cluster. The only serious 
limitation is due to the necessity of knowing a priori the 
scale on which the substructure is to be found. This is an 
intrinsic drawback of the method and no improvement in 
numerical codes based on this technique could make up for 
it. An interscale connectivity graph can be helpful to discri- 
mate among th e scal es at which a physical object show fea- 
tures (Bijaoui (1995), and private communication) but this 



technique is presently beyond the purpose of our analysis. 
Besides, a significant number of data points is required to 
perform an accurate analysis. We estimate about 200 galax- 
ies to be a good minimum number to allow a rather accurate 
recovery in a distant cluster. As the distance decreases, we 
need a larger number of galaxy members so that a statis- 
tical redistribution can compensate for redshift distortion. 
For the closest clusters we have taken into account (8h~ 
Mpc) 800 members should be enough. Obviously, the larger 
the number of data points, the more accurate the analy- 
sis. Finally, it is clear that this method can not give any 
kind of dynamical information on the clusters investigated 
and that a companion method, li ke hi gh resol ution N-body 
simulations (e.g. Becciani et al. (1996), (1997)), is required 
for a more complete and detailed study about evolutionary 
states. 
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